Physical Equivalence of Pure States and Derivation of Qubit in General Probabilistic 

Theories 
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In this paper, we investigate a characterization of Quantum Mechanics by two physical principles 
based on general probabilistic theories. We first give the operationally motivated definition of the 
physical equivalence of states and consider the principle of the physical equivalence of pure states, 
which turns out to be equivalent to the symmetric structure of the state space. We further consider 
another principle of the decomposability with distinguishable pure states. We give classification 
theorems of the state spaces for each principle, and derive the Bloch ball in 2 and 3 dimensional 
systems by these principles. 



I. INTRODUCTION 



After the completion of von Neumann's celebrated ax- 
ioms [l[ of Quantum Mechanics (QM), one of the theo- 
retical and philosophical interests of QM shifted to de- 
rive the theory using solely physical principles. Here the 
physical principle means a physical statement which is, 
in principle, testable by experiments. For instance, a 
statement such as "a quantum observable is represented 
by a self- adjoint operator on a Hilbcrt space" is not a 
physical principle, but a statement such as "the speed of 
light is constant independent of observer's motion" gives 
a typical example of a physical principle. Indeed, the 
standard axioms of QM presuppose a priori mathemati- 
cal structures such as Hilbert space and linear operators 
- as the former example is one of the axioms — and 
then gives a recipe (Born's rule) of how to predict physi- 
cal phenomena by combining such mathematical objects. 
This non-physical characterization of QM is sometimes 
considered as one of the causes why QM is difficult to 
intuitively understand — in the end, many students who 
learns QM for the first time naively ask themselves "Why 
observables are described by (non-commutative) opera- 
tors?" On the other hand, if QM was constructed solely 
with physical principles and if, moreover, they are tested 
by experiments, then QM would be intuitively more ac- 
ceptable than the present situation, as all the "weird" 
quantum phenomena (such as the uncertainty principle 
and an entanglement) automatically follow from exper- 
imentally established phenomena. (For instance, most 
physicists feel to intuitively understand the relativity of 
time, even though contrary to our common sense, since 
they know that the invariance of the speed of light is ex- 
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perimentally confirmed and the fact naturally leads the 
relativity of time.) 

Therefore, a desire to derive axioms of QM which are 
purely described by physical principles is natural as many 
researchers indeed have tackled this problem so far &- 
0, and some of recent developments in }c^-[l0| and [ll| 
contribute the goal in each ways. 

One of the lines of this research starts by recogniz- 
ing that QM is one of the probabilistic theories. Indeed, 
with the standard interpretation of QM and indeed with 
a practical method to compare with experimental data, 
what QM predicts is a probability to obtain an outcome 
by performing a measurement under a given state. How- 
ever, the structure of the theory is more complicated than 
a classical probability theory (based on Kolmogorov's ax- 
ioms). Therefore, it is convenient to first establish a gen- 
eral framework for all the possible probabilistic models 
including both classical theory and QM. Then, we can 
look for particular conditions (described by physical prin- 
ciples) which narrow down the general models to be QM. 
Indeed, such a general framework has been well inves- 
tigated (for instance see [1343) and rece ntly is called 
General Probabilistic Theories (GPTs) [I3-[l3- 

For this purpose, we first give an operationally mo- 
tivated definition of the physical equivalence between 
states and propose a physical principle of [P5] Physical 
Equivalence of Pure States. Then we show that the prin- 
ciple is equivalently described by a symmetric structure 
of state space (See Theorem [2] and the principle [P5']). 
Next, we consider another principle of [P6] the decom- 
posability with distinguishable pure states. We give rep- 
resentation theorems for state spaces for each principle 
(See Proposition [5j Theorems [3 [6] for [P5], and The- 
orem [7] for [P6]). By combining these results, we de- 
rive that GPTs with 2 or 3 dimensional state spaces are 
either classical or quantum (See Theorem [5]). Our re- 
sults are closely related to those in (cMIoj and , where 
the general Bloch ball is obtained. However, the defini- 
tion of the Bloch ball there is given by a system with at 
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most two distinguishable states. On the other hand, we 
do not require the maximum number of distinguishable 
states and discuss how the state space is characterized. 
To understand this non-triviality, we notice that a state 
space is not uniquely determined under the above two 
principles but on the contrary still there admits a com- 
plicated state space, where the typical example is given 
by QM itself with more than or equal to 3 dimensional 
Hilbcrt spaces (see [13] )■ Notice that in this paper we do 
not deal with a dynamics, a measurement process, nor a 
composite system, which are another important notions 
for GPTs. Rather, we are interested in how GPTs are 
characterized with the above minimum restrictions (prin- 
ciples) posed only on states and measurements. 

This paper is organized as follows. In Sec. HH we 
review and reconstruct GPTs with natural physical prin- 
ciples [P1-P3], especially explaining the basic notions of 
states and measurements and how mathematical struc- 
tures such as vectors in a real vector space appear to 
represent them. In Sec. IIII1 we introduce and investigate 
the principles [P5] and [P6], including general discussion 
of the invariant states under any affine bijection on the 
state space. In Sec. IIV[ we give a concluding remark and 
discussion. 



II. REVIEW OF GENERAL PROBABILISTIC 
THEORIES 

In this section, we briefly review General Probabilistic 
Theories (GPTs) as a general framework of operationally 
well-defined probabilistic models, focusing on the notions 
of states and measurements. Based on operationally mo- 
tivated principles, we explain how states and measure- 
ments arc mathematically represented by vectors in a 
compact convex set and effects, respectively. It is im- 
portant to recognize that these mathematical structures 
are not introduced a priori but they are the consequences 
from physical principles. 

A. States and Measurements on GPTs 

The basic ingredients of GPTs are the notions of states 
and measurements with a physical law (e.g., Born's Rule 
in quantum mechanics) to predict a probability to ob- 
tain a measurement outcome. Formally, wc assume the 
following: 

[PI] (Probability Assignment Principle) A probability 
Pr{M = m;|s} to get an outcome mi is given when per- 
forming a measurement M under a state s: 

n 

Pr{M = mi\s} > 0,^Pr{A/ = m t \s} = 1. (1) 
i=i 

In this paper, we assume for mathematical simplicity 
that each measurement M has a discrete (and finite) 
set of measurement outcomes {m.i}?=i with n > 1 [3^ ]. 



In particular, 2-valued measurements with n = 2 (Yes- 
No measurements) play a fundamental role as building 
blocks of any measurements (See for instance Chapter 
6 in [ll|). In the following, the symbols s, si,S2, 
M, Mi, M2, • • • and m, mi, TO2, ■ ■ ■ represent states, mea- 
surements and measurement outcomes, respectively. 
With an operational view, we naturally identify states 

51 and S2 if there are no physical differences between 
them. In our case, the only method to compare physi- 
cal properties of states is to observe probabilities given 
through possible measurements. Thus we may assume 
the following: 

[P2-1] (Separation Principles for States) States s\ and 

52 are identified iff Pr{M = m.;|si} = Pr{M = m^^} 
for any measurement M and measurement outcome m,i . 

Indeed, the preliminary notion of "state" in an oper- 
ational view is defined by a preparation of experimental 
instruments (settings). Then, a state is defined by the 
equivalence class among all the "states" having the same 
physical properties, which in our case are the probability 
distributions among possible measurements. Therefore, 
one may interpret [P2-1] as one aspect of a natural defi- 
nition of states. 

The similar argument follows for measurements. 
Namely, we identify measurements if there are no phys- 
ical differences among them under any states. It is also 
convenient to drop the information of measurement out- 
come mi by identifying them just as an zth outcome. By 
doing so, we denote by Pr{M = i\s} := Pr{M = 77ii|s} 
the probability to get ith outcome when performing a 
measurement M under a state s. Thus we identify n- 
valued measurements M and M' irrespective of the set 
of outcome if their statistical properties are identical for 
each outcome. Therefore, we assume the following: 

[P2-2] (Separation Principles for Measurements) Mea- 
surements Mi and Mi with the same number of outcomes 
are identified iffPv{M\ = i\s} = Pr{M2 = i\s} for any i 
and state s. 

It is operationally natural to allow a probabilistic mix- 
ture of states as one of the procedures to prepare states. 
Namely, if states Si and 82 can be prepared, a preparation 
of state si with probability p and state S2 with probabil- 
ity 1 — p can also be prepared. With this preparation, the 
probability for any measurement should satisfy the mix- 
ing property due to the union rule of the probability and 
the definition of the conditional probability. Formally, 
we thus assume the following: 

[P3-1] (Mixing Principle for States) For any state si, 
S2 and for any p £ [0, 1], there exists a state s satisfying 
Pr{M = i\s} = pPv{M = i\ Sl } + (1 - p)Pr{M = i\s 2 } 
for any measurement M and i. 

Of course, we implicitly suppose that the above state 
s can be prepared as a probabilistic mixture of s\ and S2 
with probabilities p and 1 —p. From [P2-1], it is uniquely 
determined with given s\, S2 and p, and is denoted by s = 
(p; si, S2) (Til ]. Note that the trivial mixtures s = (p; s, s) 
and s = (1; s, s') = (0; s', s) follow for any state s, s' and 
p € [0,1]. A state is called a pure state iff there exist no 
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means to prepare it with nontrivial probabilistic mixture. 
Namely, s is a pure state iff s = {p; s±, S2) for states s±, S2 
andp G (0, 1) implies si = S2- Otherwise, a state is called 
a mixed state. In the following, we denote the set of all 
states and pure states by S and Spure , respectively. S is 
called the state space. 

Similarly, it is operationally legitimate to allow a prob- 
abilistic mixture of measurements: Namely, if n-valued 
measurements Mi and M2 can be performed, then a mea- 
surement to perform Mi with probability p and M2 with 
probability 1 — p is also possible: 

[P3-2] (Mixing Principle for Measurements) For any n- 
valued measurement M\, M2, and for any p G [0, 1] there 
exists an n-valued measurement M satisfying Pr{M = 
i\s} = pPr{Mi = i\s} + (1 - p)Pt{AI 2 = i\s} for any 
i = 1, . . . , n and state s. 

From [P2-2], the measurement M with given Mi, A'h.p 
in [P3-2] is uniquely determined up to arbitrary choice of 
measurement outcomes. 

So far, we have only assumed natural principles [Pl]- 
[P3] resorting to no a priori mathematical structures. 
However, it turns out that states and measurements are 
represented by vectors in a real vector space as follows: 

[Representation 1] (i) A state is represented by a vec- 
tor in a real vector space V such that a convex combina- 
tion of states s\ and S2 with weight p represents the state 
(p; S\, S2); Thus the state space S is a convex subset ofV 
and pure states correspond to extreme points of S- (h) 
An n-valued measurement M is represented by an n-tuple 
of effects (ej)" =1 on S such that ^ = u, meaning that 
ei(s) is the probability to get ith outcome when perform- 
ing the measurement M under a state s. 

Remind the following mathematical terminologies: a 
subset W in a real vector space is called a convex subset 
iff it is closed under any convex combinations. A vector 
w G W is called an extreme point iff w has no non-trivial 
convex decompositions in W, i.e., w = pw\ + (l— p)w2 for 
some u>i, u>2 G W and p G (0, 1) implies w = w\ = W2- A 
real functional / : W — >• R is called an afhnc functional 
on W iff f{pwi + (1 -p)w 2 ) = pf{wi) + (1 -p)f{w 2 ) for 
any wi,W2 6 W and p G [0, 1]. An affine functional e on 
W is called an effect on W iff the range is in [0, 1]. The 
unit effect and the zero effect, denoted by u and 0, are 
effects defined by u(s) = l,0(s) = for all s G W. For 
the readers' convenience, we present the precise proof of 
Representation 1 in Appendix lAl fSee Theorem [5]). 

Notice that a representation of states and measure- 
ments is of course not unique. GPTs with state repre- 
sentations S C V and S' C V in real vector spaces V, V 
are equivalent (hereafter affine isomorphic) iff there ex- 
ists an affine bijection A : S — > S' such that the corre- 
spondent of measurement M = (ei)™ =1 on S is given by 
M' = (e; o A _1 )™ =1 on S' ■ In particular, if the (affine) di- 
mension of S is finite, one may represent states as vectors 
in an Euclidean space. 

In the following, we use the Representation 1 and let 
S be a state space of a vector representation as above. 
We denote by A(S) and £(S) (or simply A and £) the 



sets of all the affine functionals and effects on 5- It is 
easy to show that A{S) is a real vector space with point- 
wise sum and multiplication and £{S) is a convex subset 
of A(S)- An extreme point of £(S) is called a decision 
effect (or a pure effect) 0,[Hi. 

Notice that, while any measurement M is represented 
by an rt-tuples (ei)™ =1 of effects on S satisfying ei = u, 
the opposite might not hold in general. However, as it 
is satisfied in both classical and quantum systems (see 
Sec. |HB|) . in this paper we assume the following postu- 
late: 

[P4] Given a state space S, any mathematically well- 
defined measurement on S is feasible. Namely, for any 
n-tuples of effects (e,-)^_i with ~Y^i e i — u > there exists 
the corresponding measurement M such that M is repre- 
sented by (ei)i. 

With this condition, it turns out that only one has 
to specify is the state space of GPT, since then feasible 
measurements are automatically given by effects on S- 

In this paper, we assume [P1-P4] as basic principles for 
operationally well-defined probabilistic models. 

Next, we explain that a natural topology is introduced 
into a state space. Note that any physical measurements 
are accompanied by a finite (even small) error. Thus, it 
is operationally natural to say that states s\ and s 2 are 
close iff for arbitrary but finite numbers of measurements 
the probabilities by measuring them under s\ and S2 are 
within an arbitrary small error. This defines the so-called 
physical topology [13j . With this natural topology, we can 
go further from the Representation 1: 

[Representation 2] A state is represented by a vector 
in a locally convex Hausdorff topological vector space V 
such that the topology is given by the physical topology 
and — without loss of generality — the state space is a 
compact convex subset ofV. In particular, i/dim( l S) =: 
d < 00, a state is represented by a vector in M. d such that 
the physical topology is the Euclidean topology and S is a 
compact (i.e., closed and bounded) convex subset in M. d - 

Note that dim S is the dimension of the affine hull 

Aff G£) : = IE* \ s i I Wi G S, Xi G M, J2i = !}■ Wc re- 
fer [22| for a (locally convex Hausdorff) topological vec- 
tor space V. Remind that if V is finite dimensional, 
then the topology is unique. Thus, if V = M d , then 
the unique topology is given by the Euclidean topology 
of M. d - Any non-empty compact convex subset W of V 
has the non-empty set of extreme points of W and is the 
closed convex hull of extreme points (Krein-Milman The- 
orem). In finite dimensional case, any vector w G W has 
a convex decomposition with finite numbers of extreme 
points: w = PiW; with extreme points {uii}i and 

a probability distribution (pi)ti (see, for instance, The- 
orem 5.6 in [23|]). Therefore, in any GPTs, there exist 
enough pure states so that S is a closed convex hull of 
Spure- Physically speaking, any state s is approximately 
(in the sense of the physical topology) prepared with a 
probabilistic mixture of finite numbers of pure states. If 
dim Aff( l S) < 00, any state s is exactly prepared with 
a probabilistic mixture of finite numbers of pure states 
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(hereafter, a pure-state decomposition). For the readers' 
convenience, Representation 2 is explained in details in 
the case of finite dimension in Appendix |B] Indeed, one 
can introduce a natural norm on V and all the topological 
issues can be described with this norm in the finite di- 
mensional case. (See [l3| and [lH for infinite dimensional 
cases.) 

In the following, we restrict ourselves to finite dimen- 
sional GPTs where the state space S is embedded into 
a finite dimensional vector space V. For the underlying 
vector space, we assume without loss of generality V := 
AfF(S). In particular, one can assume S to be a compact 
(i.e., closed and bounded) convex subset in a Euclidean 
space R d noting that the Euclidean topology coincides 
with the physical topology. 

Notice that, for any state s, one can define an affinc 
functional A s : £ -> [0, 1] by 

A s (e) := e(s) (Ve £ £), 

which satisfies A s (u) = 1 and A s (0) = 0. From [P2-1], 
we have A Sl = A S2 iff si = Therefore, a state is rep- 
resented by an affine functional A : £ — > [0, 1] satisfying 
A(u) — 1 and A(0) = 0. (Indeed, in Appendix 1X1 a vec- 
tor representation of states is introduced in this way.) It 
is interesting to notice that the converse is also true: 

Theorem 1 Let A be an affine functional on £ = £(S) 
such that A(e) £ [0, 1] for any e £ £ and A(u) = 1, A(0) = 
0. Then, there exists the unique state s £ S such that 
A(e) = e(s) for any e £ £. 

(The proof of this theorem is given in Appendix |P| ) 
Therefore, one can identify states and affine functionals 
on £ satisfying the above properties. In the following, we 
omit the symbol A s and just write 

s(e) = e(s), (2) 

meaning that s(e) := A s (e). 

To complete the framework of GPTs, one still needs 
another important notions of transformation (dynamics) , 
measurement process [24[, and especially composition of 
systems. In this paper, we consider GPTs only from 
the viewpoints of states and measurements by consid- 
ering how physical principles posed only on states and 
measurements can narrow down GPTs. This means, of 
course, that there remains a freedom to further specify 
GPTs by using another principles on dynamcis, compo- 
sition of systems, etc. f8l-fl0l|. 

B. Examples of GPTs 

The typical examples of GPTs are Classical Probabil- 
ity Theory and QM. Here, we briefly review the finite 
cases for classical theory and QM: 

[Example 1] (Finite Classical Probability Theory) A 
state of a finite classical probability theory is described 
by a probability distribution p = (pi, . . . ,p c ) on a sample 



space r^ c = {uji, 0J2, ■ • ■ , w c }, and the state space is given 
by Sd ■= {p = (jH,...,Pc) I Pi > 0,Eift = 1} C R c . 
There exist c numbers of pure states (vertices): p^ = 
(1,0, ... , 0), . . . ,pW = (0, . . . , 0, 1). Geometrically Sd is 
the c — 1 dimensional (standard) simplex. [39| . 

Any classical state p = (pi,...,p c ) £ Sd has the 
unique decomposition into pure states: p — PjP^' > 

and it is convenient to represent an effect e by c di- 
mensional vector (xi, . . . ,x c ) £ [0,l] c by Xi := e(p^) 
with e(p) = J2jPj x j- We write e ~ (xx, ■ ■ ■ , x c ) in 
this case. Note that u ~ (1, . . . , 1) and ~ (0, . . . , 0). 
It is easy to see that £{Sd) ='■ £d — [0, l] c , so 
there are 2 C decision effects (i.e., extreme points of £ c {): 
e *it2-»c ~ (ii,i 2 , ■ • ■ ,ic) where ii,...,i c = 0,1. Any 
measurement M = (e^)" =1 is represented by n vectors 

e; ~ {x[ l \...,xf) £ [0,l] c such that ^xf = 1. 
Typical measurement is given by that of random vari- 
able / : Q c — > {mi, ...,m n } where Pr{/ = m,i\p\ := 
E r , m ^fU)Pj = Ei=i^m»/(j)Pi- Since (S mifU) ) c j=1 is 
one of decision effects, a measurement of random vari- 
able corresponds to a measurement with respect to deci- 
sion effects. 

Note that any c-simplex is affine isomorphic to the c- 
dimensional standard simplex. Therefore, any GPT with 
simplex-state space is a classical system. 

[Example 2] (Finite Quantum Mechanics) A state of a 
finite QM is represented by a density operator p (a pos- 
itive operator with unit trace, denoted as p > and 
tr p = 1) on a c dimensional Hilbert space % c . The 
state space is thus given by S q '■= {p £ C(Hc) | P > 
0,trp = 1} C C(Hc)h, where C(Uc) (C(Hc)h) is the 
set of all the linear (Hermitian) operators on % c . Pure 
states correspond to the one dimensional projection op- 
erators, i.e., p = \4>){il)\ with unit vector ip £ "% c . A 
quantum effect e is usually represented by a positive op- 
erator E £ C(Hc) such that e(p) = tr(Ep) given by 
E '■= Yn,j=i witn an arbitrary or- 

thonormal basis {^j}? =1 of % Cl where e is a linear ex- 
tension of e to C(Hc) (Sec Appendix [Cj) . Note that the 
unit and zero effects u, correspond to the unit and zero 
operators I and 0, respectively. It is easy to see that 
£ q := £(S q ) =i £(Hc) P ov m ■= {E £ £(Hc)\0 < E < l}, 
the element of which is called a POVM (positive oper- 
ator valued measure) clement. Thus, any measurement 
AI = (ei)2 =1 is represented by n-tuple of POVM ele- 
ments e l ~ Ei £ C(7ic)povm such that Ya=i E i = I) 
which is called a (discrete) POVM. The decision effects 
correspond to projection operators. 

The most elementary quantum system is a qubit sys- 
tem with 2 dimensional Hilbert space %2- For a qubit 
system, the Bloch vector b = (bi,b 2 , b 3 ) £ M 3 , defined by 
S q ^ p i-> bi — tr((T,p) (i = 1,2,3), gives a useful state 
representation. (Here, <Xj (i = 1,2,3) are Pauli spin ma- 
trices). Remind that the state space is then given by a 
unit ball B := {b £ R 3 \ YV 6? = 1} C E 3 H]. 

Notice again that both classical and quantum theories 
satisfy [P4], i.e., any mathematically well-defined mea- 
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surcment corresponds to a feasible measurement by ad- 
mitting the so-called indirect measurement [40 ] . 



III. ADDITIONAL PRINCIPLES 

In the previous section, GPTs are explained as opera- 
tionally well-defined probability models based on natural 
principles [P1]-[P4]. In this section, we further consider 
additional two principles of [P5] physical equivalence of 
pure states and [P6] decomposability with distinguishable 
pure states. 



A. Physical Equivalence of Pure States 

Intuitively, it seems natural to assume that all the pure 
states are physically equivalent, i.e., there are no phys- 
ically exceptional pure states in nature due to its sym- 
metry. This would be acceptable if one reminds both 
classical and quantum cases, where in both cases there 
are no structural differences in the set of pure states. 
(For instance, consider a one dimensional classical par- 
ticle, where the set of pure states is the phase space K 2 
of the position and the momentum. Obviously, there are 
no exceptional points in R 2 . Also in quantum systems, 
there are no exceptional vectors (pure states) in Hilbcrt 
space.) As this argument yet resorts to a rough intuition, 
we consider here what we exactly mean by the physically 
equivalence of states with an operational view point of 
measurements. 

First of all, we would like to express that states s\, S2 £ 
S are physically equivalent if there are no different phys- 
ical structures on s\ and s 2 through measurements. This 
would be naturally characterized by saying the following: 
For any measurement E = (ei)™ =1 , there uniquely exists 
the corresponding measurement F = (/i)™ =1 such that 
the probability distributions on si by E and S2 by F arc 
the same: ei(s 1 ) = /i(s 2 ); Next, this correspondence of 
the measurements should preserve the affine structures 
for a convex combination of measurements. Indeed, one 
can perform a convex combination of measurements by 
a probabilistic mixture of each measurement. Therefore, 
the corresponding measurement of a convex combination 
of E and E 1 should also be performed by the probabilistic 
mixture of F and F' (the corresponding measurements 
of E and E 1 ) with the same weight. Finally, the corre- 
spondent of the trivial measurement (u) (i.e., 1-valued 
measurement) should be again the trivial measurement 
(u). From these considerations (in particular for yes- 
no measurements), we can give the formal definition of 
state-equivalence as follows: 

Definition 1 (Physical Equivalence of States) We say 
that state si is physically equivalent to state S2 if there 
exists a unit-preserving affine bijection <fr : £ ^ £ such 
that e(si) = tf>(e)(s2) for any e £ £ . 



We denote by s\ ~ S2 iff s± is physically equivalent to 
S2 ■ It is easy to see that ~ is an equivalence relation: (i) 
s ~ s, (ii) si ~ s 2 => s 2 ^ si and (iii) si ~ s 2 , s 2 ~ s 3 => 
si ~ S3 for any s,Si,s 2 ,S3 £ S- (Use identity map on 
£, an inverse map and composition of maps on £ as the 
above map <&): 

Proposition 1 Physical equivalence ~ is an equivalence 
relation. 

Based on this operationally motivated definition, we con- 
sider the following principle: 

[P5] (Physical Equivalence of Pure States) Any pure 
states si and S2 are physically equivalent. 

The following result gives an equivalent characteriza- 
tion of the physical equivalence in terms of the structure 
of state space: 

Theorem 2 States s\ and S2 are physically equivalent 
iff there exists an affine bijection ^ : S — > S such that 
si = *(s 2 ). 

To prove this theorem, we start from the following fact 
on effects: The set of effect £ = £(S) has a natural order 
relation defined by 

e > / & e(s) > f(s) Vs £ S 

for effects e, / £ £. Note that u and are maximum and 
minimum effects. 

Lemma 1 Let $ : £ — > £ be an affine bijection. Then 
the following three conditions are equivalent: 

1. $(0) = 

2. Ife,fe£ and e < f, then $(e) < $(/) 

3. $(it) = u 

(The proof is given in Appendix [Dj This lemma is used 
just for justifying that we have assumed in Definition [T] 
only the latter one of the two conditions $(0) = and 
<f>(u) = u.) 

Now we introduce the dual maps of affine bijections on 
£ and S, respectively. 

First, let $:£—!- £ be an affine bijection on £ such 
that <&(m) = u (and thus $(0) = from Lemma[T|). Then, 
the dual map <&* : S —> S on S is defined by 

$*(s)(e) := s($(e)) (Vs GS,e£ £). (3) 

(See Eq. ^ for the notation.) We notice that Eq. d3j> 
for fixed s £ S defines an affine functional on £ satisfy- 
ing conditions in Theorem[T] To see this, let A := $*(s). 
For any A e [0, 1], ei, e 2 £ £, we have A(Aei + (l — A)e 2 ) = 
s($(Aei + (1 - A)e 2 )) = s(A$(d) + (1 - A)$(e 2 )) = 
As($(ei)) + (1-A)s($(e 2 )) = AA(ei) + (1 — A)A(e 2 ), and 
thus A is an affine functional on £ . Since s(e) £ [0, 1] for 
any e £ £, the range of A is in [0, 1]. Finally, A(it) = 
s($(u)) = s(u) = 1 and A(0) = s($(0)) = s(0) = 0. 
Thus <f>* is a well-defined map on S from Theorem [1] 

Notice that, for any A £ [0,l],si,s 2 £ S, one has 
$*(Asi + (1 - A)s 2 )(e) = (Asx + (1 - A)s 2 )$(e) = 



6 



(A$*(si) + (1 - A)$*(s 2 ))(e) (Ve G £), hence $* is affine 
on S- Let $*(si) = §*(s 2 ) for some states s lf s 2 G 5- 
Then, since / := $ _1 (e) G £ for any e G £, we have 
Sl (e) = sr^^-He))) = = **(»2)(/) = 

s 2 (<&($ 1 (e))) = s 2 (e). Therefore, from the separation 
hypothesis for states, we have si = s 2 , i.e., $* is injective 
(one-to-one). Notice that ( i> _1 is also an affine bijection 
on £ such that <f>" 1 (ti) = u and <J> _1 (0) = 0. For an 
arbitrary s£5, define i := ( ( E>~ 1 )*(s) G S, and it follows 
that for any e G £ 

d>*(t)(e)-<(<i>( e ))- S (<f>- 1 ($( e ))) = S (e), 

and again from the separation hypothesis for states, we 
have s = $*(<). Therefore, <£>* is surjective, hence an 
affine bijection on S- 

On the other hand, the dual map of the affine bijection 
^ : S — > S is easily defined by 

**(e)(s) := e(*(«)) (Ve G £, s G 5). (4) 

Since the above defined \&*(e) is obviously an affine func- 
tional on S to [0, 1], ^*(e) is an effect. It is easy to see 
that \&* is bijective on £ and satisfies ^*(u) = u (and 
**(0)=0). 

Finally, it is also easy to see that both dual maps on 
£ and S satisfy ($*)* = $ and (vf*)* = from the 
definitions. To sum up, we have obtained 

Proposition 2 (i) For any affine bijection <f> : £ — > £ 
such that <f>(it) = u (and thus <I>(0) = 0), the dual map 
$* : S — > S is well-defined by Eq. ([3]) and is affine bijec- 
tive. (ii) For any affine bijection ^ : S S, the dual 
map \&* is well-defined by Eq. ^ and is affine bijective 
satisfying ty*(u) = u (and thus V E'*(0) = 0^. (Hi) For both 
bijections, ($*)* = $ and (**)* = 

Now we can give the proof of Theorem [2j 
[Proof of Theorem [2] Let si,s 2 be physically equiva- 
lent, i.e., there exists a bijective affine $ : £ — > £ such 
that e(si) = $(e)(s 2 ) for any e € £ and $(u) = m (and 
thus $(0) = from Lemma[T|). From Proposition [5] (i), 
v]> := $* gives an affine bijection on S- From Eq. ©, we 
have *(s 2 )(e) = s 2 ($(e)) = $(e)(s 2 ) = e(si) = Si(e) for 
any e G £, and thus si = v I/(s 2 ). 

Conversely, assume that there exists an affine bijection 
^ ■ S — > S such that si = ^>(s 2 ). Then, from Proposition 
[2] (ii), the dual map <J> := ^P* is an affine bijection on £ 
satisfying that $(e)(s 2 ) = #*(e)(s 2 ) = e(*(s 2 )) = e(si) 
and <f>(u) = u. Therefore, states Si,s 2 are physically 
equivalent. ■ 

From Theorem [2l [P5] is equivalcntly rephrased as fol- 
lows: 

[P5']For any given pure states si,s 2 G S, there exists 
a bijective affine map ^ on S such that s 2 = ^(si). 

In [l9| , we introduced the principle of equality of pure 
states intuitively meaning [P5] but have used [P5'] as its 
formal definition. Theorem [5] establishes its operational 
meaning as an equivalence of pure states. Some authors 



[8l4icj| also have used [P5'] meaning that there are re- 
versible transformations (dynamics) which connects any 
pure states. Thus, we have shown the equivalence of two 
superficially different physical principles. 

On the other hand, from the mathematical and phys- 
ical interests, Davies [26| have investigated the following 
condition: 

[P5"] The group of affine bijections on S acts transi- 
tively on the set of extreme points of S- 

It turns out that [P5], [P5'], and [P5"] are all equiva- 
lent: To see this, let Ab{S,S) be the set of affine bijec- 
tions on S- One can show that Ab(S, S) forms a compact 
group which acts continuously on S [2a|- Note that an 
affine bijection on S maps a pure state to a pure state 
[4l| . Therefore, it is straightforward that [P5'] holds 
iff affine bijections on S acts transitively on Spure, i-e., 
Spure = {^(s) I G Ab(S,S)} with arbitrary reference 
pure state s G Spure- 

In the following, we prefer to interpret the above equiv- 
alent conditions physical equivalence among any 
pure states [P5]. Indeed, as is the case for any basic 
physical theories so far, a symmetry is one of the key 
factor to understand our world, and [P5] enables us to 
interpret, without appealing to dynamics, that there ex- 
ists an operationally motivated symmetry in the set of 
pure states. Of course, this is a matter of taste, and the 
following argument follows for those who prefer [P5'] or 
[P5"]. However, following Davies, we call GPTs satisfy- 
ing [P5] (or equivalently [P5'] or [P5"]) symmetric GPTs. 



1. Examples of Symmetric GPTs 

Both classical theory and QM are symmetric GPTs. 
Indeed, for any pure states p^\p^ G Sci with some 
fi, v = l,...,d, let VP be a map on Sci defined by 

[*(p)k = P*,[*Cp)]»< = Pi* an d [*(p)]i = Pi 
/i, v). Obviously \& is an affine bijection on Sci satisfying 
^(p M ) = p v . In quantum system, for any pure states 
pi := \ipi){i/j 1 \ i p2 := l^a)^! € S q , one can easily find a 
unitary operator U : %d Hd such that U4>2 = 4>i EH • 
Then it is easy to see that *B(p) := UpU^ is an affine 
bijection on Sq satisfying pi = ^(p 2 ). 

Another sim ple example of symmetric GPTs are 
cuboid systems [271 ] : 

[Example 3] (Cuboid Systems) We call GPTs cuboid sys- 
tems if the state spaces are represented by hypcrcubes: 
Scube ■= {x G R d | < x, < 1 (i = l,...,d)}. 
There are 2 d pure states (vertices): x^ ll ' l2 ''"' li ^ := 
(h,i2, •■■,id) (h, ■ ■ ■ ,id = 0, 1). 

It is easy to see that the cuboid system Scube is also 
symmetric. In general, any isogonal figure is symmetric 
from the definition: 

[Example 4] (Systems with Isogonal Figures) A polytope 
in Euclidean space is said to be isogonal iff for any ver- 
tices there exists an isometrical map which surjectively 
connects them [28| . (Notice that any isometrical map on 
Enclidcan space is automatically affine.) Therefore, any 
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GPTs with state space being isogonal figure are symmet- 
ric. 



with polytope state space, where the set of pure states is 
{si, . . . , s n }, we have 



2. Invariant States 



1 

sm = - 



(5) 



In this section, we discuss an invariant state (the max- 
imal mixed state) in GPTs. 

In general GPTs, the existence of an invariant state 
with respect to all the bijective affine maps on S can be 
shown based on Ryll-Rardzcwski fixed point theorem. 

Theorem 3 For any GPT, there exists an invariant 
state Si nv G S with respect to all the bijective affine maps 
on S- 

[Proof] We refer to [H, H3l for Ryll-Rardzewski fixed 
point theorem. Using the norm in Appendix [B] the un- 
derlying vector space V of S is a Banach space w.r.t. 
the sup-norm and S is a compact convex subset of V. 
Since we assumed dim V < oo, all topologies in the Ryll- 
Nardzewski theorem are equivalent to the norm topology. 
Let G = Ab(S,S) be the set of all the bijective affine 
maps on S- Then, it is easy to show that G is a semigroup 
of (weakly) continuous affine maps of S- In particular, 
G is noncontractive on S, since the norm || • || satisfies 
inf« gG ||*(«i) - $(s 2 )|| = - « 2 || > (Vsi / S2 £5) 
from Lemma 0] in Appendix [B] Applying the Ryll- 
Nardzewski theorem, the assertion of the theorem holds. 
■ 

In general GPT, an invariant state is not necessarily 
unique. (For instance, consider a GPT with the state 
space being a circular sector S = {{rcosO, rsin#)|0 < 
J"<1,O<0<S}. Then, there are only two affine bijec- 
tions on S'- the identity map and the reflection map with 
respect to the 45 degree axis. Then, all the states on the 
axis are invariant under these maps.) However, in a sym- 
metric GPT, the unique invariant state is determined. 

Theorem 4 In a symmetric GPT, an invariant state is 
unique. 

[Proof] Assume that there exist distinct invariant states 
S\,S2 on S- Then, any state of the form ps\ + (1 — p)s2 G 
S for any p G R is invariant under any bijective affine 
^ : S — > S (See Lemma [S] in Appendix [C]) . In particular, 
there exists such state S3 on a boundary of S- Let H C V 
be the supporting hyperplane of S on S3 and let S3 = 
Y17=oPrti (P» > 0) ( n G N) De a convex decomposition of 
S3 with respect to pure states {ti}. Then, since S3 G H 
and if is a supporting hyperplane of S, we have t{ G HHS 
for every ti. Notice that there exists a pure state sq G S 
not in H . (Otherwise, S C H.) Since S is symmetric, 
there exists an affine bijective map VP which connects to 
and s : s = f(t ). However, we have Y^^PiU = s 3 = 
/( s 3) = Y,i=oPif(ti) = VqSq +Pif(h)---, and this is 
contradictory. ■ 
In the following, sj\/ denotes an invariant state, which 
is uniquely determined for symmetric GPTs. For a GPT 



Indeed, as any bijective affine $ on S maps any pure state 
to a pure state, $ just permutes finite numbers of pure 
states, and we have $(sm) = \ ^K s *) = s ^i- ^ n 
particular, in classical system, sm is the uniform distri- 
bution on Q: sm = (l/d, ■ ■ ■ , 1/^)- For cuboid systems, 
we have sm = (1/2, 1/2, ... , 1/2), i.e., the center of Scube- 
In the case of finite quantum systems, notice that bijec- 
tive affine map on Sq is unitary map or anti-unitary map 
on the density operators |3lj . Therefore, the maximally 
mixed state 
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sm 



is the unique invariant state. 

In general, the construction [26[ of sm is given by the 
two-sided Haar measure [Hj]: Using the two-sided Haar 
measure on the set G of all the affine bijection on S, we 
have 1 



sm 



*(*o)dM*), 



(6) 



where s G S is an arbitrary reference state (See also 
[H[ for the mathematical details). Indeed, the unique 
existence of the invariant states of Symmetric GPTs is 
first shown by Davies [26[ by means of the Haar measure. 

Note that an operationally natural entropy in general 
GPTs is defined by 



S(s) := 



M-. 



inf 

= ( m j)ieMi, 



H ( m i(s)), 



(7) 



where H(j3j) 



Y^iPj]°&2Pj i s the Shannon entropy 



and the infimum is taken over the set Mi nc i of all the 
indecomposable measurements (l9l . |3~H . S is concave on 
(S, generalizing Shannon and von Neumann entropies in 
classical and quantum systems, respectively. In a sym- 
metric GPT, S takes minimum value on pure states, 
and thus S provides a proper measure of mixedness . 
Therefore, the following proposition implies that we may 
call sm the maximal mixed state in general (symmetric) 
GPT. 

Proposition 3 In any GPT, the entropy S takes the 
maximal value on an invariant state. 

[Proof] Notice that for a bijective affine f on 5 and 
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indecomposable measurement M 

an indecomposable measurement [19(. Thus, we have 
S(^(s)) = S(s) for any state s. Then, from the concav- 
ity of S, for any state s, the invariant state Sm obtained 
by ([6]) from the reference state sq := s satisfies 



s(s M )> / s(*(s)HiW = 

Jg 



S{s)dfi($) = S(s) 
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The next result shows a global picture of a state space 
S in symmetric GPT. Let D be a distance function which 
have the monotonicity properties under affine bijection 
on S, i-e., for any states S\,S2 G S and any afhne bijec- 
tion : S -> 5, £>(si,s 2 ) > D(*(si),*(s 2 )). The typ- 
ical example is given by the Kolmogorov distance (See 
Appendix [Bj . 

Proposition 4 In symmetric GPTs, for any distance 
function D on S with the monotonicity property, the dis- 
tances between sm and arbitrary pure states are constant. 
Namely, for any pure states s lf S2, 

D(s u sm)=D(s 2 ,sm) (8) 

[Proof] From the symmetry of S, there exists an afhne 
bijection VP on S such that si = ^(s2). From the mono- 
tonicity of D and the invariance of sm, we have 

D( Sl , s M ) = D{«£{s 2 ), < D(s 2 ,s M ). 

The opposite inequality follows by a symmetric argu- 
ment. I 
Geometrically speaking, all the pure states of symmet- 
ric GPTs lie on the sphere with respect to the distance 
with the monotonicity property, e.g. Kolmogorov dis- 
tance. In the case of Kolmogorov distance (|B2|) , physical 
meaning of © is that the optimal success probability to 
distinguish between the invariant state Sm and any pure 
state s is constant on s (Sec Eq. (|B2[) in Appendix |B|) . 

3. Classification of Symmetric GPTs 

Now we proceed to the classification theorem of Sym- 
metric GPTs. First, remind that without loss of gener- 
ality, S is a compact subset of R d , and by the parallel 
shift, we can assume the invariant state sm of G to be 
the origin of Mo- 
using the Haar measure on the set of G of all the afhne 
bijection on S, one can define a G-invariant inner prod- 
uct by (x, y)a '■= Jq(^ x , ^v)d^{^!) where the right-hand 
side is the Haar integral and (x, y) denotes the Euclidean 
inner product of x, y G M. d - Then, using an orthonor- 
mal basis w.r.t. this G-invariant basis, we get to a new 
state-representation in M. d such that $eG acts isomet- 
rically on states. Remind that, in Sec. IIII A If Example 
4), we have just illustrated symmetric GPTs with state 
spaces being isogonal figures, but now it earns the gen- 
eral symmetric GPTs if the numbers of pure states are 
finite: 

Proposition 5 The state space S of a symmetric GPT 
is either one of isogonal figures or with infinite pure 
states. 

In [33[ , we further obtain the following classification the- 
orems of symmetric GPTs in 2 and 3 dimensional cases: 



Theorem 5 The state space S of symmetric GPTs with 
dim(Aff S) = 2 is either an isogonal figure or a unit disk. 

Theorem 6 The state space S of symmetric GPTs with 
dim(Aff S) = 3 is any one of an isogonal figure, a circular 
cylinder, or a unit disk (ball). 

B. Decomposability of states with distinguishable 
pure states 

In this section, we consider another principle of the 
decomposability of states with respect to distinguishable 
pure states. 

Remind that states {si, . . . , s n } are said to be distin- 
guishable if they can be distinguishable with probability 
1 in a single shot measurement; namely if there exists a 
measurement M = (e,)^_ 1 such that Ci(sj) = 5ij for any 
i,j = 1, . . . , n. The maximum number of distinguishable 
states is an important parameter to characterize GPTs, 
and is denoted as c. In classical systems, c is the number 
(cardinality) of the sample space fi c , while in quantum 
systems it is the dimension of the corresponding Hilbert 
space T-t c . In general GPT, c has the following upper 
bound: 

Proposition 6 In GPT with state space S, we have 

c < dim(5) + 1. (9) 

The eguality holds iff S is a simplex. 

To see this, observe the following lemma: 

Lemma 2 A distinguishable set of states {si, . . . , s n } is 
affinely independent. Therefore, Conv{s;} is an (n — I) 
dimensional simplex. 

[Proof] Let ^27=1 P^ s i = S™=i Qi s i ^ e convex combina- 
tions of {si}i w.r.t. probability distributions (pi)i and 
(qi)i- Then, applying a measurement M = (ei)i which 
distinguishes {si}i, we get pi = qi, and thus {s;}; is 
affinely independent (See [39]). ■ 
[Proof of Proposition [6] Let {si, . . . , s c } be a set of 
distinguishable states. Then, from Lemma [5J Conv{sj} 
is a (c — 1) dimensional simplex. As Convjsi} C S, we 
have c — 1 < dim(,S). 

Let the equality of Eq. © holds. Let d := dim(<S) and 
let {si, . . . , Sd+i} be distinguishable states with a mea- 
surement M = {m,j) d ~^\ which distinguishes them. Note 
that, from Lemma [2 Conv {si}^ 1 is a d-dimensional 
simplex. We show that S = Convjsi}^ 1 . To see 
this, notice that the dimension of Aff {sijfij 1 is d and 
{si} d =i C 5, and thus Aff {sOf=i = Aff <S- Therefore, 
any s G S can be written as s = X)f=i ^i s i with some 
A; G M satisfying = 1- Applying m i to s (See 

Lemma [5] in Appendix |C|) . we obtain A; = mi(s) > 0, 
and thus s G Convjs;}^ 1 . Therefore, we have S C 
Conv{si} d ^ . As si G S, we obtain S = Conv{s l }^t 1 1 . 
Namely, S is d dimensional simplex. ■ 
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In both classical and quantum systems, any state is 
written as a convex combination of distinguishable pure 
states. (Note that any classical state p — (pi, ■ ■ ■ ,Pd) has 
the decomposition p = 5Zu=i P^P^i while any quantum 
state p has a decomposition p = ~YT i= iPi\il)i)(ipi\ as an 
eigenvalue decomposition of p.) This property can be 
physically phrased as following physical principle: 

[P6] (Decomposability with Distinguishable Pure 
States) Any state can be prepared as a probabilistic mix- 
ture of distinguishable pure states. 

It is interesting to consider GPTs satisfying this prin- 
ciple, which include both classical and quantum systems. 
We notice that this property is derived from their axioms 
in (U. 

The following result provides the general classification 
of such GPTs: 

Theorem 7 GPTs satisfying [P6] is either classical sys- 
tem or systems with infinite pure states. 

To prove this, we use the following lemma: 

Lemma 3 If S is a d dimensional polytope satisfying the 
principle [P6], then there exist d+1 distinguishable states. 

[Proof] The case d = is trivial. Let d > 1. Let T be 
the family of all subsets of S such that it is the convex 
hull of at most d pure states of S- Since S is a polytope, 
the number of pure states is finite, therefore the size of 
T is also finite. Now each subset of S in T is at most 
(d— l)-dimcnsional, therefore its e?-dimcnsional volume is 

0. Hence the union (J J 7 of all (finitely many) members of 
T also has d-dimensional volume 0, while S has positive 
c?-dimensional volume. This implies that S contains a 
state s that does not belong to (J T . By the principle of 
[P6], s is a convex combination of distinguishable pure 
states si, . . . , Sfc, and now k > d by the definition of T 
and s. Hence the claim holds. ■ 
[Proof of Theorem [7] Let the number of pure states be 
finite, i.e., S be a polytope. From Lemma [3l there exist 
d+1 distinguishable states. Therefore, the equality of 
Eq. ^ holds and thus from Proposition El 5 is simplex, 

1. e., a state space with a classical system. ■ 

C. Classification of GPTs with [P5] and [P6] 

Combining the classifications of GPTs with [P5] and 
[P6], i.e., Thcorcm[SJ Theorem [5] for [P5] and Thcorem[7] 
for [P6], we obtain the following: 

Theorem 8 The state space S of GPTs satisfying [P5] 
and [P6] with dim S = 2, 3 is either simplex or unit disk. 

This result implies that GPTs with principles [P5] and 
[P6] such that the state space is embedded in 2 or 3 di- 
mensional real vector space is restricted to either classical 
system or quantum system (qubit). 

Notice that this does not hold in higher dimensional 
GPTs as quantum systems with more than or equal to 3 
level systems satisfy both [P5] and [P6] while the state 
space is neither the simplex nor the ball (35j . 



IV. CONCLUSION AND DISCUSSION 

In this paper, after confirming our basis of general 
GPTs (namely, Principles [P1]-[P4]), we have introduced 
and investigated two physical principles [P5] (Physical 
Equivalence of Pure States) and [P6] (Decomposability 
with Distinguishable Pure States). In particular, the for- 
mer is established with the operationally motivated def- 
inition of physical equivalence of states, which turns out 
to be equivalent to the symmetric structure of the state 
space (See Sec. IIII A|) . In each principle, we gave a clas- 
sification of state spaces (Theorems [5l |5] for [P5] and [7]) 
for [P6]) and by combining them, we showed that GPTs 
with 2 or 3 dimensional state spaces are either classical 
or quantum system (with the Bloch ball) (Theorem [5]) . 
Note that similar characterizations of the Bloch ball are 
derived in l^-tllj. However, the definition of the Bloch 
ball there is the state space of systems with at most two 
distinguishable states, while our derivation does not as- 
sume the maximum number of distinguishable states. In 
this paper, we focused on simple principles on states and 
measurements, and showed how each principle narrows 
down GPTs close to classical and quantum systems. In 
near future, we further consider other principles on espe- 
cially correlations on composite systems, and discuss the 
characterization of QM by purely physical principles. 
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Appendix A 

In this appendix, we give a proof of Representation 1 
which is formally described by the following Theorem: 

Theorem 9 Assume [P1]-[P3] and let S be the state 
space. Then, there exist a real vector space V and a rep- 
resentation map : S ~> V such that (i) the map ~ is in- 
fective (one-to-one) and {p; si, S2) = psi + (l — p)s%; (thus 
S is a convex subset of V and pure states correspond to 
extreme points of S), and (ii) an n-valued measurement 
M is represented by an n-tuple of effects (e^)" =1 on S 
such that Pr{M = i\s} = ei(s) for any s G S (and thus 
~Ylii e i = u where u is the unit effect). 

[Proof] We call a real functional / : S — > R an "affine" 
functional iff /((p; s 1: s 2 )) = pf(si) + (l-p)f(s 2 ) for any 
si, S2 & S and p € [0, 1]. Also an "affine" functional e on 
S is called an "effect" if the range is in [0,1]. The unit 
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"effect" u is the effect such that u(s) = 1 for any s E S- 
We denote by A(S) and £(S) the set of all the "affine" 
functionals and "effects" on S- It is easy to see that 
A{S) forms a real vector space with pointwise addition 
and scalar multiplication, while £(S) is a convex subset 
of A(S). 

Let M be an n-valued measurement. Then, the prob- 
ability Pr{M = i\s} with a fixed i is considered as a map 
from S to [0, 1]: 

seS-> e;(s) := Pr{M = i|s} (i = 1, . . . , n), (Al) 

which are clearly "effects" from [P3-1]. Remind that 
measurements are characterized only through probabil- 
ities from [P2-2]. Thus M is characterized by n-tuple 
of "effects" (e,)r=i where e t E £(S) is defined by ([AT]) . 
From the normalization condition of the probability, we 
have ei = u. In the following, we denote by 

M = (ei)f =1 an n-valued measurement meaning that 
ei(s) is the probability to get ith outcome when perform- 
ing M under state s. Note that the part (ii) of Theorem 
[9] is an easy consequence of this representation of mea- 
surements. (See the last part of this proof.) 

Next, let V be the set of all the affine functionals on 
£(S) with bounded ranges. It is easy to see that V is 
a real vector space with pointwise addition and scalar 
multiplication. Now we define a state-representation map 
" : S -> V by 

8(e) := e(s) (Ve E £ (S)). (A2) 

Note that s is an affine functional on £(S) by the defini- 
tion, while the range of s is bounded since < e(s) < 1, 
thus * maps any state to a vector in V. Let si = s 2 for 
si,s 2 £ <S- Then e(si) = s"i(e) = 82(e) = e(s 2 ) for any 
"effect" e. Therefore, for any measurement M = (ej)™ =1 , 
Pr{M = rrii\si} = e^si) = e !; (s 2 ) = Pr{Af = m;|s 2 }, 
which implies s± = s 2 from [P2-1]. Thus, " is in- 

jective. From the "affinity of effect", (p;si,s 2 )(e) = 
e((p;s 1 ,s 2 )) = pe(s 1 ) + (l-p)e(s 2 ) = (ps 1 + (l-p)s 2 )(e) 
for any e E £(<S)- Thus, the state s = (p;si,s 2 ) is 
mapped to be a convex combination of si and s* 2 : 

(p; si, s 2 ) = ps"i + (1 - p)s 2 . (A3) 

Notice that from [P3-1] the set 5 := {s E V|s £ S}, 
which is the state-space of this representation, is a convex 
subset of V. It is straightforward from the definition that 
pure states are mapped to extreme points of S, and vice 
versa. This completes the proof of part (i) of Theorem [HI 
Finally, let "be a representation map on £(S) to the set 
of affine functionals on S defined by e(s) := e(s) (Vs £ <S). 
It is easy to see the injectivity and the affinity of ". Since 
the range of e for e £ £ is in [0, 1], e is an effect on 5. 
Therefore, any n-valued measurement M is represented 
by an n-tuple of effects (ej)" =1 on S such that J2i e "« i s 
the unit effect and Pr{M = i\s} = e~i(s) for any s £ S- 

■ 

The representation symbols * and " are all omitted 
throughout this paper except in the above proof. 



Appendix B: "Physical Topology" on State Space 

In this appendix, we review a physical topology and 
show Representation 2 in finite dimensional cases. For 
the readers' convenience and mathematical simplicity, we 
explain these using a Kolmogorov distance. 

Remind from Appendix [X] that each vector in V is a 
bounded affine functional on £(S)- Then, one can intro- 
duce a natural norm on V by 

\\v\\ := sup|w(e)|. (Bl) 

ees 

(Notice that \\v\\ < 00 as v is a bounded functional on 
£; it is clear that (i) ||u|| > 0; (ii) ||u|| = iff v = 0; 
(iii) ||cw|| = |a|||u||; (iv) ||i> + w|| < |w|| + ||w|| for all 
v, w £ V and a £ R.) Notice that the norm of any state 
s is 1 as e(s)| < 1 for all e £ £ and u(s) = 1. Thus, S 
lies on the unit sphere with respect to this norm. The 
induced metric 

Dkoi(v,w) := \\v - w\\ (v,w £ V) 

is called the Kolmogorov distance [36|, since it is a 
straightforward generalization of the Kolmogorov dis- 
tance in classical systems. In QM, the norm and distance 
are the trace norm and trace distance. The Kolmogorov 
distance on S has an operational meaning through the 
relation 

£> feo *(si,s 2 ) = 2P(si,s 2 )-l, (B2) 

where P(s%, s 2 ) is the optimal success probability to dis- 
tinguish states si and s 2 which are prepared with prob- 
abilities 1/2 and 1/2 (see [l9| and references therein). 
Therefore, the topology with respect to this norm has an 
operational meaning as states s\ and s 2 are close w.r.t. 
this norm iff the optimal success probability to distin- 
guish them are close to 1/2. 

From (|B2|) . state space S is bounded in V with this 
norm as probabilities are in [0,1]. In general, S might 
not be closed. However, with a physical consideration, it 
turns out that without loss of generality one can assume 
S is closed. Indeed, even if there exists an "imaginary 
state" v £ V in the closure of S which is not in S, there 
exists a state s £ S arbitrary close to v with this norm. 
Namely, for any small e > 0, there exists a state s such 
that \v(e) — s(e)\ < e for any e £ £. Thus, under the 
presence of an (arbitrary small but) finite error as always 
the cases in the real experiments, it is impossible to (even 
statistically) distinguish the "imaginary state" and state 
s through all the possible measurements. Therefore, it 
is a matter of taste to include or not every "imaginary 
states" as states. 

To sum up, one can assume that S is represented as 
a closed and bounded convex subset of a normed space 
V. Note that, if dim V < 00, compactness and closed 
boundedness arc equivalent from the Hcinc-Borcl The- 
orem for general normed space [37j]. Moreover, in that 
case, this topology is the same as the physical topology 
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[131 ] due to the uniqueness of the topology of the locally 
convex Hausdorff topological vect or sp ace. 

We note the monotonicity holds [19( for any affine map 
* on 5: 

||*(s)||<IN|. (B3) 

and thus 

||*(*i)-*(* a )||<||«i-« 2 || (V«i,« a e5). (B4) 

Applying this for an afhnc bijection on S, we get the 
following: 

Lemma 4 Any affine bijection 3/ on S preserves the 
Kolmogorov distance: 

||*(si)-*(fl 2 )|| = |K-*2||. 

Proof Apply (|B4)) for vE and VP" 1 . ■ 

Appendix C: Extension of affine maps 

Let Vi , V2 be real vector spaces and W be a convex 
subset of V\. Then, any affine map A : W — > V2 has an 
affine extension to V\. With some additional conditions, 
A is further extended to a linear map on V\. 

In this appendix, we present the proof of these facts for 
finite dimensional cases [43j |: Let dim V\ < 00. Through- 
out this section, A is an affine map from W to V2. 

Lemma 5 Let Wi £ W and A, £ 1 (i = l,...,m) wif/i 
E 4 Aj = 1. J/ 2™ 1 Aiiu 4 G W, men 

m m 

Proof. Let C+ := {i = {l,...,m}|A 4 > 0} and C_ := 
{i = {l,...,m}|Aj < 0}. Since E^Li Aj = 1; ^ follows 
that 1 + Eiec_ M = E;ec+ I AiL and thus c + 
EieC + M = : A > 1, |Ai| < A for any i = l,...,m. 
Letting w := Efci Ai^ G W, we have 

ieC- iec + 

where both right and left sides are convex combinations 
of vectors from W. From the affinity of A, we have 

1am + £ J^ak) = £ l^Afa), 

which implies (|C1|) . ■ 

Proposition 7 A /ias an affine extension A to Vi with 
the form 

A(v) = Av + b (C2) 

where A is a linear map from V\ to V2 and b G V%. If 
Aff(W) = Vi, then the extension is unique. 



Proof. Let dim AS(W) = n and dim V\ = n+m (to > 0). 
Let Wi G W (i = 0, . . . , n) be affinely independent set so 
that — i«o (i = 1, . . . , n) are linearly independent. Let 
{^{"j"™ be a basis of V\ such that m = wi — wo for 
i = 1, . . . , n. Then, define a map A : Vi — > V2 in (|C2[) 
with a linear map A : V\ — > V2 and b G V2 given by 

Av . ._ (A{ Wl )-A(w ) i = l,...,n 

v[ i = n + 1 , . . . , n + m 

b := A(wq) — Awq 

where v[ arc arbitrary vectors in V2 ■ Then, one can show 
that A is an affine extension of A. To see this, note first 
that any w G W is uniquely written by w = p XiWi 
with A, G R such that ElLo A» = 1 since {wi}™ =0 
are affinely independent and n is the dimension of W . 
Then, for any w = E£=o ^ iWi e W' ^ f°ll° ws that 
AH = (ElLi AiA^i) + A Aw + (A(too) - Aw ) = 
£?i A i (Aii;o+A(w i )-A(u;o))+Ao^u;o+(A(K;o)-^wo) = 
Ei=o A i A H) ~ A A(u; ) + (1 - A )(Aw - A(w )) + 
XqAwq + (A(wq) — Awq) = A(w), where we have used 
Lemma [5] in the final equation. Affinity of A follows from 
the form of (IC2l . 

Let AS(W) = V\. Assume that A and A' are affine ex- 
tensions of A. Noting that any vector v G V\ = Aff(W) 
is uniquely written by v = E"=o ^i w i with E™=o A» = 
1, we have A(v) = Yh=o X iH w i) = E"=o A i A (wi) = 
Y^i=o AiA'(wj) = A'(v) by the same argument as Lemma 
[5] (note that each Wi is in the domain W of A). Thus the 
uniqueness of the extension follows. ■ 

The following result gives a linear extension of A: 

Proposition 8 (i) IfOtf. Aff(W), A has always a linear 
extension to V\. In particular, if dim V\ = dim Aff(M /r ) 
+ 1, the extension is unique, (ii) If G Aff(Vl / ) ; then A 
does not necessarily have linear extensions. In particular, 
in the case G W , A has a linear extension to V\ iff 
A(0) = 0. 

Proof. Here we use the same notation as in the proof of 
Proposition [71 

(i) Let g Aff(W). Obviously, AS(W) ^ Vi and 
to > 1. Using the basis {^i}™^"™, let — w = Y^i=\ l x i v i 
with some Xi G R. Then, there exists at least one non- 
zero coefficient Xi [i = n + 1, . . . , to); Otherwise we have 

-Wo = E"=l x i( w i - w o) and tnus = E"=l X i W i + i 1 ~ 

E"=i ^) w o G Aff(W), which contradicts ^ Aff (W). 
Without loss of generality, let 7^ and choose 

<+i = ~^( A (wo) + Y2=iMM*»i) - A(»o))) and 
v[ = (i = n + 2, . . . , n + to). Then, a direct compu- 
tation shows that ^4u>o = A(iuo), i-e., 6 = 0. 

Next, we show the uniqueness of the linear extension 
for the case dim V = dim Aff(VF) + 1, i.e., m = 1. In 
this case, one can choose v n+ i := u>o in a basis {i>»}£ji • 
Indeed, if E"=i a i( w i ~ w o) + ot n +\WQ = and at least 
one on is non-zero, then a n +i ^ by linear independence 
of Wi — wo = Vi (1 < i < n), but this implies GAff(M / ), 
which is a contradiction. Therefore, {«i}" =1 with v\ = 
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Wi — wq and v n +i = wq is an independent system of 
V and thus gives its basis. Let A and A' give linear 
extensions of A. Then, using the above basis, any vector 
v G V is written as v = ^\ Xi(wi — wq) + x„ + iWq. Since 
Aw = A(w) = A'w for any w £ W, we have Av = A'v for 
any v G V. This proves the uniqueness of the extension. 

(ii) Let G W. Then b = A(0). Therefore, in this case, 
an affine map can be linearly extended iff A(0) = 0. ■ 

It is well-known that a quantum operation A on the set 
S q of density operators is uniquely extended to a linear 
map on a vector space Cij-ld) of all the linear operators 
on a Hilbert space Hd- While there are several ways to 
show this extension, it can be derived from Proposition 
E] as an easy corollary. Indeed, it is easy to check that 
Sq is a convex subset of a real vector space V\ := C{%)h 
of the set of Hermitian operators, satisfying ^ Aff(S q ) 
and dim C("H)h = dim AS(S q ) + 1. Therefore, one can 
uniquely extend A to a linear map A on C(j-i)h- Next, 
noting that any linear operator A G £{7i) is uniquely 
decomposed into the sum of real and imaginary parts, 
i.e., A = A R + iAi where A R := \{A + A t ) and Aj := 

(A — A^ ) , one can further extend A to a linear map A 

on c(H) by 1(A) := A(A R ) + iA(Ai). 

We notice that, in general GPTs, one can also treat 
operations (including dynamical map and measurement 
processes) described by its linear extension in a similar 
manner. 

In the following, we assume AS(W) = V\ = V2 and 
let A : W — > W be an affine map on W. We show that 
any surjective affine map on W is affincly extended to a 
bijective map on V := V\. 

Lemma 6 If A is surjective on W , so is the affine ex- 
tension A : V — > V on V . 

Proof As Aff(IF) = V, an arbitrary y G V is written 
as V — Y^=o^i w i w ith some A, G R and £),-Ai = 1- 
From the surjectivity of A on W, there exists w\ such 
that Wi = A(w' i ) (Vi = 0, ...,n). Therefore, we have 

v = Et AiA(^) = E« HM + b) = A(J2i Wi) + b = 

A(J2i Xitu'i). Thus, A is surjective on V. M 

Proposition 9 If A is surjective on W, then A : V — > V 
and A are bijective. 

Proof From the previous Lemma, A is surjective on V 
and so is the linear map A : V — > V which composes 
A. Remind that a linear map on a finite dimensional 
vector space is surjective iff it is injective. Therefore, 
A is bijective and so is A on Injectivity of A on W 
follows since it is a restriction of the injective map A to 
W. M 



Appendix D 

In this appendix, we give proofs of Lemma [1] and The- 
orem Q] 



[Proof of Lemma [Q (1 2) If e < /, then g := //2 
and k := f — e are clearly effects. As g = (f + 0)/2 = 
(e + k)/2 G £, wehave$(.g) = ($(/) + $(0))/2 = ($(e) + 
$(fc))/2. Since $(0) = 0, we have $(/) = $(e) + $(fc), 
and thus $(e) < $(/). (2 => 3) By the relation <^- 1 (u) < 
u and the assumption, we have u = $($~ 1 (it)) < $(u), 
therefore $(u) = u. (3 =>■ 1) For any g G £, u — g G £ 
and thus (<%) + $(u - g))/2 = (<P(u) + $(0))/2 G 
Since <f>(w) = u, we get + $(u - 5) - $(0) = u > 
<i>(u - g), therefore &(g) > $(0). Put g = $ _1 (0), we 
have > $(0), and thus = $(0). ■ 
Let S CR d with Aff (S) = R d , then from Proposition 
[3 any effect e G f is uniquely represented by a G R d , 6 G 
M such that 

e(s) = (a, s) + b, 

where (•, ■) represents the Euclidean inner product in R". 
(Remind that any real linear functional A : M. d — > M. 
is uniquely represented by a vector a G R™ such that 
Ax = (a,x) for any x G R d .) Let a map" : £ -)• R d+1 
be defined by e = (a, 6), then it is easy to see that ~ is 
an affine injection and £ is a convex subset in R rf+1 . In 
particular, u = (0, 1) and = (0, 0). 

[Proof of Theorem [T| Let A be an affine functional 
on £ defined by A(e) := A(e). Note that (0, 0) = G £ 
and A(0, 0) = A(0) = 0. Thus, from Proposition [51 A can 
be represented by a vector (x, y) G R d x R = R d+1 such 
that 

A(a, b) = ((x, y), (a, b)) = (x, a) + yb. 

Since 1 = A(u) = A(0, 1), we have y = 1: 

A(o, b) = (x, a) + b. 

In the following, we prove that s := x G S, and thus 
for any effect e G £ with e =: (a, b), we have A(e) = 
A(a, b) = (s,a) + b = e(s). 

To prove this, assume contrary that x $ S, then 
there exists a hyperplane which strictly separates {x} 
and S (see Theorem 4.12 in [23]); Equivalently, there ex- 
ists a (continuous) affinc functional / : R d — 5- R such 
that f(x) < and /(f) > for any f G S- Then, 
f := maxtgs fit) f ^ S an en " GC t such that > f'(x). Note 
that /' can be represented as f'(z) = (do, z)+bo (z G R d ), 
and thus /' = (ao,£>o) from the uniqueness of the repre- 
sentation of effects. Thus, we have 

> f'(x) = (a ,x) +b Q = A(a , b ) = A(f'). 

This contradicts that A(e) > for any e G £ . 

Finally, the uniqueness follows from the separation hy- 
pothesis for states [P2-1]. ■ 
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